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Abstract 

Applying the enumeration of sparse set partitions, we show that the 
number of set systems H C exp({l, 2, . . . , re}) such that ^ H, J2egh l-^l = 
re and \Je<eh -£^ = {1,2,..., m}, m < n, equals (1/ log(2) + o(l)) n 6 n where b n 
is the n-th Bell number. The same asymptotics holds if H may be a multiset. 
If vertex degrees in H are restricted to be at most k, the asymptotics is 
+ o(l)) n 6 n where is the unique root of J2i=i x i /i\ — 1 in (0, 1]. 



1 Introduction 



Let h' n , where n 6 N = {1,2,...}, be the number of simple set systems H such that 
the edges E G H are nonempty finite subsets of N, the weight w(H) = J2e<eh \E\ 
of H equals n, and the vertex set is (normalized to be) [m] = {1,2, .. . , m} for 
some m. Let h" n be the number of these set systems with weight n if edges may 
be repeated, i.e., if H may be a multiset. For example, Hi = {{1}, {2}, {3}}, 
H 2 = {{1,2}, {3}}, H 3 = {{1,3}, {2}}, H 4 = {{2,3}, {1}}, H 5 = {{1,2}, {1}}, 
H e = {{1,2}, {2}}, and H 7 = {{1,2,3}} show that h' 3 = 7. The three additional 
multisets H 8 = {{1}, {1}, {2}}, H g = {{1}, {2}, {2}}, and H w = {{1}, {1}, {1}} 
show that h'l = 10. We investigated h' n and h" n in [TT] where we calculated the 
values 
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and proved that b n < h' n < h" n < 2 n ~ 1 b n , b n being the n-th Bell number (only 
the last inequality is nontrivial). In this note we prove the following stronger 
asymptotic bound. 

Theorem. Ifn — > oo, h' n and h" n have the asymptotics 

((log2)- 1 + o(l)) n • b n = (1.44269 . . . + o(l)) n ■ b n 
where b n are Bell numbers. 
The first Bell numbers are 



n || 1 | 


2 


3 


4 


5 


6 


7 j 


8 ] 


9 


10 


b n || 1 | 


2 


5 


15 


52 


203 


877 


4140 


21147 


115975 



and their asymptotics is recalled in Proposition ESI In [TT] we investigated besides 
h' n and h" n the numbers h n of simple set systems H with vertex set [n]; h n were 
considered before by Comtet [HI p. 165], Hearne and Wagner and Macula [13] 
and they constitute entry A003465 of Sloane [T7j . 

In Section 2 we prove the theorem. In Section 3 we give concluding comments, 
point out some analogies and pose some open questions. Now we recall and fix 
notation. For n G N, {1, 2, . . . , n} is denoted [n]. For A, B C N, A < B means that 
x < y for every x G A and y G B. We use notation f(n) <C g(n) as synonymous 
to the f(n) = 0(g(n)) notation. The coefficient of x n in a power series F(x) is 
denoted [x n ]F. A set system H is here a finite multisubset of exp(N) whose edges 
E G H are nonempty and finite. The vertex set is V(H) = UEeH E. The degree 
deg(x) = deg^(x) of a vertex x G V(H) in H is the number of edges containing 
x. If there are no multiple edges, we say that H is simple, if is a partition if 
its edges are mutually disjoint; in the case of partitions they are usually called 
blocks. The number of partitions H with V(H) = [n] is the Bell number b n . 
The weight of a set system H is w(H) = J2 v ev{H) deg(v) = J2ezh\E\- H ls 
normalized if V(H) = [m] for some m. In the proof of Proposition 12.31 we work 
with more general set systems H with vertex set contained in the dense linear 
order of fractions Q. We normalize such set system by replacing it by the set 
system H' = {f(E) : E G H}, V(H') = [m], where / : V(H) — > [m] is the unique 
increasing bijection. 

2 The proof 

To estimate h' n and h" n in terms of b n , we transform a set system H in a set partition 
with the same weight by splitting each vertex v G V(H) in deg H (v) new vertices 
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which are 1-1 distributed among the edges containing v. The following definitions 
and Propositions 12.11 and 12.21 make this idea precise. 

We call two set partitions P and Q of [n] orthogonal if \AC\B\ < 1 for every two 
blocks A G P and B G Q. Q is an interval partition of [n] if every block of Q is a 
subinterval of [n\. For n G N we define W(n) to be the set of all pairs (Q, P) such 
that Q and P are orthogonal set partitions of [n] and Q is moreover an interval 
partition. We define a binary relation ~ on W(n) by setting (Qi,Px) ~ {Q21P2) 
iff Q\ = Q 2 and there is a bijection / : Pi — > P2 such that for every A G Pi the 
blocks A and /(A) intersect the same intervals of the partition Qi = Q 2 . It is an 
equivalence relation. 

Proposition 2.1 For every n G N, there is a bijection (Q, P) 1— > H(Q, P) between 
the set of equivalence classes W(n)/ ~ and the set L(n) of normalized set systems 
H with weight n. In particular, h" n = \L(n)\ = \W(n)/ ~ |. 

Proof. We transform every (Q,P) G W(n), where Q consists of the intervals 
Ji < I 2 < ■ ■ ■ < I m , in the set system H = H(Q, P) = (E A : A G P) where 
E A = {i G [m] : AnJ^ 0}. We have w(P) = n and V{H) = [m], so H G L(n). 
It is easy to see that equivalent pairs produce the same H and nonequivalent pairs 
produce distinct elements of L(n). 

Let H G L{n) with V(f/') = [a]. We split [n] in a intervals Ii < I 2 < ■ ■ ■ < I a 
so that = deg H (i). For every i G [a] we fix arbitrary bijection fa : {P G H : 
i G E} -> Ii. We define the partitions Q = (h, I 2 , . . . , I a ) and P = (v4 s : E e H) 
where Ae = {fi(E) : i G P}. Clearly, (Q,P) G VF(n) and different choices of 
bijections fa lead to equivalent pairs. Also, H(Q, P) = H. Thus (Q, P) 1— > P(Q, P) 
is a bijection between W(n)/ ~ and L(n). □ 

The next proposition summarizes useful properties of the equivalence ~ and 
the bijection (Q,P) 1— > H(Q,P). They follow in a straightforward way from the 
construction and we omit the proof. 

Proposition 2.2 Let (Q,P) G Q = (h < h < ■ ■ ■ < I m ), and H = 

H(Q,P) (so V(H) = [m]). Then deg H (i) = for every i G [m]. The equivalence 
class containing (Q,P) has at most ■ \I 2 \\ ■ . . . ■ |/ m |! pairs. It has exactly so 
many pairs if and only if H is simple. 

Proposition 2.3 For every n G N, h' n < h!' n < 2h' n . 

Proof. The first inequality is trivial. To prove the second inequality, we construct 
an injection from the set N(n) of normalized non-simple set systems H with weight 
n in the set M(n) of normalized simple set system H with weight n. Then h" n = 
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\M(n) \ + \N(n)\ < 2\M{n)\ = 2h' n . We say that a vertex v G V(H) is regular if 
deg(f ) > 2 or if v G E for some E G H with \E\ > 2, else we call v singular. Thus 
v is singular iff {v} G if and deg(t>) = 1. 

Let if G N(n). We distinguish two cases. The first case is when every multiple 
edge of if is a singleton. Then let k > 2 be the maximum multiplicity of an edge in 
if and v = u — 1 where u G V(ff ) is the smallest regular vertex in if; we may have 
v = and then v is not a vertex of if. We have t> < max V(H) and insert between 
v and w new vertices Wi, i = 1, 2, . . . , k — 1 and u < u>i < u>2 < • • • < Wk-i < u. 
Then we replace every singleton multiedge {x} with multiplicity m, 2 < m < k, 
(we have x > u) with the new single edge {wi, u> 2 , ■ ■ ■ , ii> m -i> Normalizing the 
resulting set system we get the set system H'. Clearly, if' G M{n). 

The second case is when at least one multiple edge in if is not a singleton. We 
define k, v, u, and Wi, . . . ,Wk-i as in the first case and replace every multiedge 
E with multiplicity m, 2 < m < k, (we have mini? > u) by the new single 
edge {wi, W2, ■ ■ ■ , w m -i} U E. We add between Wk-i and u a new vertex s and 
add a new singleton edge {s}. This singleton edge is a marker discriminating 
between both cases and separating the new vertices Wi from those in E. Since 
m — 1 + \E\ < m\E\ if \E\ > 2 and m > 2, the weight is still at most n. We 
add in the beginning sufficiently many new singleton edges {— r}, . . . , {—1}, {0} so 
that the resulting set system has weight exactly n. Normalizing it, we get the set 
system if'. Again, if' G M(n). Note that in both cases the least regular vertex 
in if' is ui\ and that in both cases the longest interval in V(H') that starts in w\ 
and is a proper subset of an edge ends in Wk~i- 

Given the image if' G M(n), in order to reconstruct if we let w G V(H') be the 
least regular vertex (i.e., w is the first vertex lying in an edge E with \E\ > 2) and 
let i be the longest interval in V(H') that starts in w and is a proper subset of an 
edge. If max f + 1 is a singular vertex of if', it must be s and we are in the second 
case. Else there is no s and we are in the first case. Knowing this and knowing (in 
the second case) which vertices are the dummy Wi, we uniquely reconstruct the 
multiedges of if. Thus if i— > if ' is an injection from N(n) to M (n). □ 

For k, n G N we define h' k ' n to be the number of normalized set systems with 
weight n and maximum vertex degree at most k. The number of simple such 
systems is h' kn . Proposition 12.41 can be proved by an injective argument similar 
to the previous one and we leave the proof to the interested reader as an exercise. 
But note that one cannot use the previous injection without change because it 
creates vertices with high degree. 

Proposition 2.4 For every k,n G N we have h' kn < h kn < 2h' kn . 



4 



For the lower bound on h" n we need to count sparse partitions. A partition P of 
[n] is m-sparse, where m G N, if for every two elements x < y of the same block we 
have y — x > m. Thus every partition is 1-sparse and 2-sparse partitions are those 
with no two consecutive numbers in the same block. If w! < m, every m-sparse 
partition is also m'-sparse. The number of m-sparse partitions of [n] is denoted 
b njTn . The following enumeration of sparse partitions was obtained by Prodinger 
[TE] and Yang [THj, see also Stanley [TBI Problem 1.4.29]. Here we present a simple 
and nice proof due to Chen, Deng and Du [Sj. 

Proposition 2.5 Let m, n G N. For m > n there is only one m-sparse partition 
of [n] . For m < n the number b n>m of m-sparse partitions of [n] equals the Bell 
number 6„_ m+1 . 

Proof. For m > n the only partition in question is that with singleton blocks. 
Let P be a partition of [n\. We represent it by the graph G = ([n],E) where 
for x < y we set {x, y} G E iff x, y G A for some block A of P and there is no 
z G A with x < z < y. The components of G are increasing paths corresponding 
to the blocks of P. Equivalently, G has the property that each vertex has degree 
at most 2 and if it has degree 2, it must lie between its two neighbors. Now 
assume that P is 2-sparse. We transform G in the graph G' — ([n — 1], E') where 
E' = {{x,y — 1} : {x, y} G E,x < y}, i.e., we decrease the second vertex of each 
edge by one. Note that G' is again a graph (no loops arise). The property of G 
is preserved by the transformation and hence the components of G' are increasing 
paths and G' describes a partition P' of [n — 1]. Clearly, P is m-sparse iff P' 
is m — 1-sparse. Thus P i— > P' maps m-sparse partitions of [n] to m — 1-sparse 
partitions of [n — 1]. The inverse mapping is obtained simply by increasing the 
second vertex of each edge (in the graph describing a partition of [n — 1]) by one. 
Thus P i— > P' is a bijection between the mentioned sets. Iterating it, we obtain 
the stated identity. □ 

See jH] for other applications of this bijection. We remark that the representing 
graphs of partitions (but not the transformation of G to G') were used before by 
Biane £Q and Simion a Ullman [16 S . 

We need to compare, for fixed m, the growth of b n and 6 n _ m . The following 
asymptotics of Bell numbers is due to Moser and Wyman . 

Proposition 2.6 For n — ► oo ; 

\{n) n+l / 2 

n ~ n l/2 e n+l-A(n) 

where the function A(n) is defined by A(n) log A(n) = n. 
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It follows by a simple calculation that b n -i/b n ~ log n/n. More generally, we have 
the following. 

Corollary 2.7 If m fixed and n — > oo, 

bn-m ^ ( \ogn \ m 
b n V n J 

In fact, a better approximation is b n -\/b n ~ {\ogn— loglogn)/n. Knuth ^2] gives a 
nice account on the asymptotics of b n and shows that 6 n _i/6 n = (£/n)(l + 0(l/n)) 
where £ • e^ = n. 

We are ready to estimate the numbers of normalized set systems with weight 
n and maximum degree at most k. 

Proposition 2.8 For fixed k G N and n — * oo ; 

where ct^ is the only root of the polynomial J2i=i x % ji\ — 1 in (0, 1]. 

Proof. Let i n ^ be the number of interval partitions Q = (I\ < I2 < ■ ■ ■ < I m ) of 
[n] such that < for all i and Q is weighted by (|/i|! • Ih^- • ■ ■ ■ ■ |/ m |!) _1 . It 
follows that 

1 / 1 \ n 

in,k = [x n \- =^ -jr. ~ c fc — 

with some constant > because is the only root of the denominator in (0, 1] 
and it is simple. Using Propositions 12.11 12.21 and 12.41 we obtain the inequalities 

in,kbn,k — ^k,n — 2i n ,kbn- 

In the first inequality we use the fact that if Q is an interval partition of [n] with 
interval lengths at most k and P is a /c-sparse partition of [n], then Q and P are 
always orthogonal. In the second inequality we neglect orthogonality of the pairs 
(Q, P) but we assume that the corresponding equivalence classes in W(n) have full 
cardinalities |ii|! • | _Z" 2 1 ! - . . . • |/ m |!. By Proposition 12.21 this gives an upper bound 
for h' kn . Using Proposition 12.41 we get an upper bound for h" kn . The explicit 
lower and upper bounds on h'l n now follow from the above asymptotics of i n ^, 
Proposition 12.51 and Corollary 12.71 □ 

Note that l/a 2 = (1 + v / 3)/2. Thus we have roughly ((1 + v / 3)/2) n 6 n = 
(1.36602 . . .) n b n normalized set systems with weight n, in which each vertex lies in 
one or two edges. 
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Theorem 2.9 For n — > oo ; 

^ = (1^2 + ^Y bn = (i- 44269 • • • + 

Proof. Let i n be the number of interval partitions Q of [n], weighted as in the 
previous proof. As in the case of bounded degree, by Propositions 12.11 El 'A and 
12.31 we have the upper bound 

K, < 2i n b n ~ c( r-^—) K 



log 2 



because 



\X =T7 = \X 



and log 2 is a simple zero of 2 — e x . As for the lower bound, h!' n > h!' kn for every 
k, n G N. It is easy to show that a& j log 2 for fc — * oo. Hence, by the lower 
bounds in Proposition 12.81 for any e > we have ft,^ > ((log2) -1 — e) n b n for n big 
enough. □ 



3 Concluding remarks 

P. Cameron investigates in |lj a family of enumerative problems on 0-1 matrices 
including h' n and h" n as particular cases. He defines Fijki(n), i,j,k,l G {0,1}, 
to be the number of rectangular 0-1 matrices with no zero row or column and 
with n l's, where i — 0, resp. i — 1, means that matrices differing only by a 
permutation of rows are identified, resp. are considered as different; j = 0, resp. 
j = 1, means that matrices with two equal rows are forbidden, resp. are allowed; 
and the values of k, I refer to the same (non) restrictions for columns. Notice 
that Fijki(n) is nondecreasing in each of the arguments i,j,k,l. Representing set 
systems by incidence matrices, rows standing for edges and columns for vertices, 
we see that h!' n = F ni(n) and h' n = F on(n). (Another language which can 
be used to deal with these problems, besides set systems and 0-1 matrices, is 
that of bipartite graphs.) In 0] it is shown that -F im (n) ~ Ac n+1 n\ where A = 
|exp(-(log2) 2 /2) w 0.19661 and c = (log2)" 2 w 2.08137. F i i(n) is A049311 of 
[TTj . see also Cameron [3]. P. Cameron asks in [2, Problem 3] if there is an effective 
algorithm to calculate F ioi(n); for h' n and h!' n such algorithms are given in [TT] . 

Interestingly, in the so far derived asymptotics of the functions Fijki(n) the 
constant log 2 w 0.69314 appears quite often. Our theorem says that allowing 
intersecting blocks in "partitions" of [n] is reflected in the counting function by 
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multiplying b n by the exponential factor (log2)~ n . The same phenomenon occurs 
for counting injections and surjections. If i n is the number of injections from [n] to 
N with images normalized to [n] and s n is the number of all mappings ("injections 
with intersections") from [n] to N, again with images normalized to [m] (i.e., s n 
counts surjections from [n] to [m]), then i n = n\ (trivial) and s n ~ c(log2)~ n n! 
where c = (21og2) _1 (a nice exercise on exponential generating functions, see 
Flajolet and Sedgewick i7 4 Chapter 2.3.1]). 

Another parallel can be led between sparse partitions and sparse words. We 
say that a word u = . . .ai over an alphabet A, \A\ = r, is ^-sparse if <2j = aj, 
j > i, implies j — i > k. (We remark that /c-sparse words are basic objects in 
the theory of generalized Davenport-Schinzel sequences, see Klazar ^Dj). The two 
notions of sparseness, in fact, coincide: u = a\02 ■ ■ ■ ai defines a partition P of [I] 
via the equivalence i ~ j <^=^ ai = aj and then, obviously, u is fc-sparse if and 
only if P is fc-sparse. A given partition of [I] can be defined by many words u (even 
if A is fixed). The unique canonical defining words are restricted growth strings, 
see [12] for their properties and more references. (Another term for 2-sparse words 
is Smirnov words which is used, e.g., in jjj or in Goulden and Jackson we prefer 
the former term.) If v n is the number of all words over A (\A\ = r) with length n 
and Sk t n is the number of those which are /c-sparse, then v n = r n (trivial) and 

Sfcn = r (r - 1) . . . (r - k + 2)(r - k + l) n 
r(r-l)...(r-k + 2) f k-l 
(r — k+ l) fc_1 V r 

(by a simple direct counting, for the generating functions approach see [3 Chapter 
3.6.3]). In the case of words over a fixed alphabet sparseness diminishes the count- 
ing function by an exponential factor. Fortunately, for partitions the decrease is 
much smaller, only by a polynomial factor ( Corollary 12. 7|) . 

We conclude by mentioning two natural questions. What is the precise asymp- 
totics of h!' k n and h'£ By Propositions O and E31 1/2 < h! k Jh!' k n < 1 and 
1/2 < h' n /h'n < 1. Does this ratio go to 1 as n — ► oo? 
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